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Abstract 

Blind deconvolution (BD), the resolution of a signal and a filter given their convolution, arises in many 
applications. Without further constraints, BD is ill-posed. In practice, subspace or sparsity constraints have 
been imposed to reduce the search space, and have shown some empirical success. However, existing theoretical 
analysis on uniqueness in BD is rather limited. As an effort to address the still mysterious question, we derive 
sufficient conditions under which two vectors can be uniquely identified from their circular convolution, subject 
to subspace or sparsity constraints. These sufficient conditions provide the first algebraic sample complexities 
for BD. We first derive a sufficient condition that applies to almost all bases or frames. For blind deconvolution 
of vectors in C n , with two subspace constraints of dimensions mi and m2, the required sample complexity is 
n > miW2. Then we impose a sub-band structure on one basis, and derive a sufficient condition that involves 
a relaxed sample complexity n > mi + m2 — 1, which we show to be optimal. We present the extensions of 
these results to BD with sparsity constraints or mixed constraints, with the sparsity level replacing the subspace 
dimension. The cost for the unknown support in this case is an extra factor of 2 in the sample complexity. 

Index terms — uniqueness, bilinear inverse problem, channel identification, equalization, multipath channel 

1 Introduction 

Blind deconvolution (BD) is the bilinear inverse problem of recovering the signal and the filter simultaneously 
given the their convolutioin or circular convolution. It arises in many applications, including blind image deblur¬ 
ring |2], blind channel equalization [3], speech dereverberation |4|, and seismic data analysis [5]. Without further 
constraints, BD is an ill-posed problem, and does not yield a unique solution. A variety of constraints have been 
introduced to exploit the properties of natural signals and reduce the search space. Examples of such constraints 
include positivity (the signals are non-negative), subspace constraint (the signals reside in a lower-dimensional 
subspace) and sparsity (the signals are sparse over some dictionary). In this paper, we focus on subspace or 
sparsity constraints, which can be imposed on both the signal and the filter. Consider the example of blind 
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image deblurring: a natural image can be considered sparse over the a wavelet dictionary or the discrete cosine 
transform (DCT) dictionary. The support of the point spread function (PSF) is usually significantly smaller than 
the image itself. Therefore the filter resides in a lower-dimensional subspace. These priors serve as constraints 
or regularizers (6 10 . With a reduced search space, BD can be better-posed. However, despite the success in 
practice, the theoretical results on the uniqueness in BD with a subspace or sparsity constraint are limited. 

Early works on the identifiability in blind deconvolution studied multichannel blind deconvolution with fi¬ 
nite impulse response (FIR) models jTT 12 , in which sparsity was not considered. For single channel blind 


deconvolution, sparsity was imposed as a prior without theoretical justification 6 9 13 


Recently, recasting bilinear or quadratic inverse problems, such as blind deconvolution [To] and phase retrieval 
[l4] , as rank-1 matrix recovery problems by “lifting” has attracted a lot of attention. Choudhary and Mitra |T5] 
adopted the lifting framework and showed that the identifiability in BD (or any bilinear inverse problem) hinges 
on the set of rank-2 matrices in a certain nullspace. In particular, they showed a negative result that the 
solution to blind deconvolution with a canonical sparsity prior, that is, sparsity over the natural basis, is not 
identifiable [l6|. However, the identifiability of signals that are sparse over other dictionaries has not been 
analyzed. 

Using the lifting framework, Ahmed et al. jTo] showed that BD with subspace constraints is identifiable up 
to scaling. More specifically, if the signal subspace follows a random Gaussian model, and the filter subspace 
satisfies some coherence conditions, convex programming was shown to recover the signal and the filter up to 
scaling with high probability, when the dimensions of the subspaces m 1 and m 2 are in a near optimal regime 
mi + m 2 = 0(n), where n denotes the length of the signal. Ling and Strohmer [17 extended the model in 110 
to blind deconvolution with mixed constraints: the signal is sparse over a random Gaussian dictionary or a 
randomly subsampled partial Fourier matrix, and the filter resides in a subspace that satisfies some coherence 
condition. They showed that the signal and the filter can be simultaneously identified with high probability 
using £1 norm minimization (instead of nuclear norm minimization as in |10| ) when the sparsity level Si and the 
subspace dimension m 2 satisfy S\m 2 = 0(n). Lee et al. |18| further extended the model to blind deconvolution 
with sparsity constraints on both the signal and the filter, and showed successful recovery with high probability 
using alternating minimization when the sparsity levels si and S 2 satisfy Si + S 2 = 0(n). A common drawback 
of these works is that the probabilistic assumptions on the bases or frames are very limiting in practice. On 
the positive side, these identifiability results are constructive, being demonstrated by establishing performance 
guarantees of algorithms. However, these guarantees too are shown only in some probabilistic sense. 

To overcome the limitations of the lifting framework, the authors of this paper introduced a more general 
framework for the identifiability in bilinear inverse problems [19) , namely, identifiability up to transformation 
groups. We showed that two vectors x, y are identifiable up to a transformation group given their image under 
a bilinear mapping if: 


1. £ is identifiable up to the transformation group; 

2. once x is identified, the recovery of y is unique. 

For multichannel blind deconvolution, we were able to derive identifiability results (T9 under subspace, joint 
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sparsity or sparsity constraints within our framework. 

In this paper, we address the identifiability in single channel blind deconvolution up to scaling under subspace 
or sparsity constraints. We present the first algebraic sample complexities for BD with fully deterministic signal 
models. The rest of the paper is organized as follows. We formally state the problem setup in Section [2] In 
Section [3] we derive sufficient conditions for BD with generic bases or frames, using the lifting framework. In 
Section [4] we derive much less demanding sufficient conditions for BD with a sub-band structured basis, using 
the framework in [19 . Notably, the sample complexities of the sufficient conditions in this case match those 
of corresponding necessary conditions, and hence are optimal. We conclude the paper in Section [5] with some 
remarks and open questions. 

2 Problem Statement 

2.1 Notations 

We state the notations that will be used throughout the paper. We use lower-case letters x, y, z to denote 
vectors, and upper-case letters D and E to denote matrices. We use I n to denote the identity matrix and F n to 
denote the normalized discrete Fourier transform (DFT) matrix. The DFT of the vector x £ C n is denoted by 
x = F n x. We use l m .„ to denote a matrix whose entries are all ones and 0 mi „ to denote a matrix whose entries 
are all zeros. The subscripts stand for the dimensions of these matrices. We say that a vector is non-vanishing if 
all its entries are nonzero. Unless otherwise stated, all vectors are column vectors. The dimensions of all vectors 
and matrices are made clear in the context. 

The projection operator onto a subspace V is denoted by Py. The nullspace and the range space of a linear 
operator are denoted by Af(-) and TZ(-), respectively. We use flx,fly to denote constraint sets. The Cartesian 
product of two sets are denoted by fix x fly. The pair (x,y) £ fix x fly represents an element of the Cartesian 
product. We use ./ and 0 to denote entrywise division and entrywise product, respectively. Circular convolution 
is denoted by ©. Kronecker product is denoted by 0. The direct sum of two subspaces is denoted by 0. 

We use j. k to denote indices, and J, K to denote index sets. If the universal index set is {1, 2, • • • , n}, then 
J, K are subsets. We use |J| to denote the cardinality of J. We use J c to denote the complement of J. We 
use superscript letters to denote subvectors or submatrices. For example, x ^ represents the subvector of x 
consisting of the entries indexed by J. The scalar represents the jth entry of x. The submatrix D < J ' K> has 
size | J\ x \K | and consists of the entries indexed by J x K. The vector represents the kth column of the 

matrix D. The colon notation is borrowed from MATLAB. 

We say a property holds for almost all signals (generic signals) if the property holds for all signals but a set 
of measure zero. 
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2.2 Blind Deconvolution 


In this paper, we study the blind deconvolution problem with the circular convolution model. It is the joint 
recovery of two vectors uq £ C n and vq £ C n , namely the signal and the filteiQ given their circular convolution 
z = wo © Vo, subject to subspace or sparsity constraints. The constraint sets flu and fly are subsets of C n . 

find (u,v), 
s.t. u®v = z, 

U £ f In, V £ fly. 


We consider the following scenarios: 

1. (Subspace Constraints) The signal u and the filter v reside in lower-dimensional subspaces spanned by the 
columns of D £ C nxmi and E £ C nxrra2 , respectively. The matrices D and E have full column ranks. 
Therefore, 

flu = {« £ C” : u = Dx for some x £ C mi }, 

Gy = {v £ C n : v = Ey for some y £ C m2 } . 

2. (Sparsity Constraints) The signal u and the filter v are sparse over given dictionaries formed by the columns 
of D £ C nxmi and E £ C nxr " 2 , with sparsity level Si and s 2 , respectively. The matrices D and E are 
bases or frames that satisfy the spark condition |20|: the spark, namely the smallest number of columns 
that are linearly dependent, of D (resp. E ) is greater than 2si (resp. 2s2)- Therefore, 

Slt/ = {«eC":« = Dx for some x £ C mi s.t. ||x||o < Si} , 
fly = {*’ 6 C" : !) = Ey for some y £ C™ 2 s.t. ||j/|| 0 < s 2 } • 

3. (Mixed Constraints) The signal u is sparse over a given dictionary D £ C” xmi , and the filter v resides in 
a lower-dimensional subspace spanned by the columns of E £ C nxm2 . The matrix D satisfies the spark 
condition, and E has full column rank. Therefore, 

Slt/ = {«eC":« = Dx for some x £ C mi s.t. ||x||o < Si} , 
fly = {v £ C n : v = Ey for some y £ C m2 } . 

In all three scenarios, the vectors x, y, and 2 reside in Euclidean spaces C 1 ™ 1 , C™ 2 and C". With the 

representations u = Dx and v = Ey, it is easy to verify that z = u © v = ( Dx) © (Ey) is a bilinear function of 

x and y. Given the measurement 2 = (Dx 0 ) © (Eyo), the blind deconvolution problem can be rewritten in the 

1 Due to symmetry, the name “signal” and “filter” can be used interchangeably. 
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following form: 


(BD) find (x, y) 

s.t. ( Dx ) © ( Ey) = 2 
X £ fix, V G fly 

If D and E satisfy the full column rank condition or spark condition, then the uniqueness of (u , v) is equivalent 
to the uniqueness of (x,y). For simplicity, we will discuss problem (BD) from now on. The constraint sets fix 
and fly depend on the constraints on the signal and the filter. For subspace constraints, fix = C mi , fly = C m2 . 
For sparsity constraints, fix = {x £ C mi : ||x||o < si}, fly = {y £ C m2 : \\y\\o < S 2 }. 

2.3 Identifiability up to Scaling 

An important question concerning the blind deconvolution problem is to determine when it admits a unique 
solution. The BD problem suffers from scaling ambiguity. For any nonzero scalar a £ C such that ax 0 £ fix 
and Ly 0 £ fly , {D{ax 0 )) © {E{Ly 0 )) = {Dx q) © {Ey 0 ) = z. Therefore, BD does not yield a unique solution if 
fix, fly contain such scaled versions of Xq,pq. Any valid definition of unique recovery in BD must address this 
issue. If every solution {x,y) is a scaled version of {xQ,yo), then we must say (xg,yo) can be uniquely identified 
up to scaling. We define identifiability as follows. 

Definition 2.1. For the constrained BD problem, the solution ( XQ,yo ), in which xq 7 ^ 0 and y$ 7 ^ 0, is said to 
be identifiable up to scaling, if every solution {x,y ) £ fix x fly satisfies x = ax 0 and y = Ly 0 . 

For blind deconvolution, there exists a linear operator Gde '■ C miXm2 —» C n such that GDE{xy T ) = ( Dx ) © 
{Ey). Given the measurement 2 = Gde{xoVo) = {Dx 0 ) © {Eyo), one can recast the BD problem as the recovery 
of the rank-1 matrix Mq = Xoy^ £ flM = {xy T : x £ flx,y £ The uniqueness of Mq is equivalent to the 
identifiability of (xo,yo) up to scaling. This procedure is called “lifting”. 

(Lifted BD) find M, 

s.t. Gde{M) = z , 

.AT £ fl_M . 


It was shown [l5] that the lifted BD has a unique solution for every Mq £ flj \4 if the nullspace of Gde does not 
contain the difference of two different matrices in flj ^. 

Proposition 2.2. The pair {xo,yo) £ fix x fly (xq 7 ^ 0, yo 7 ^ Q) is identifiable up to scaling in (BD), or 
equivalently, the solution Mq = XqPq £ TIm is unique in (Lifted BD), if and only if 

A [{Gde) f|{M) - M : M £ fl M } = {0}. 

Proposition |2.2| is difficult to apply because it is not clear how to find the nullspace of the structured linear 
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operator Qde- To overcome this limitation, in our previous work (see [19 Theorem 2.8), we derived a necessary 
and sufficient condition for the identifiability in a bilinear inverse problem up to a transformation group. As a 
special case, we have the following necessary and sufficient condition for the identifiability in BD up to scaling, 
which holds for any fix and fly. 

Proposition 2.3. The pair (xo,yo) £ fix x Dy (%o / 0, j/o ^ 0) is identifiable up to scaling in (BD) if and 
only if the following two conditions are met: 

1. If there exists ( x, y ) £ fix x fly such that {Dx) © {Ey) = (Dx o) © (Ey 0 ), then x = axg for some nonzero 
a £ C. 

2. If there exists y £ fly such that (Dx o) © (Ey) = (Dx o) © (Eyo), then y = yo- 

Propositions |2.2| and |2.3| are two equivalent conditions for the identifiability in blind deconvolution. Propo- 

shows how the identifiability of (x, y) can be divided into the identifiability of x and y individually. In this paper, 
we derive more readily interpretable conditions for the uniqueness of solution to BD with subspace or sparsity 
constraints. We first derive a sufficient condition for the case where the bases or frames are generic, using the 
lifting framework. We also apply [273] and derive a sufficient condition for the case where one of the bases has a 
sub-band structure. 


sition 2.2 shows how the identifiability of (x,y) is connected to that of the lifted variable xy 1 . Proposition 2.3 


3 Blind Deconvolution with Generic Bases or Frames 


Subspace membership and sparsity have been used as priors in blind deconvolution for a long time. Previous 


works either use these priors without theoretical justification |6j-9 13 , or impose probablistic models and show 
successful recovery with high probability [To , 1T, 18 . In this section, we derive sufficient conditions for the iden¬ 
tifiability of blind deconvolution under subspace or sparsity constraints. These conditions are fully deterministic 
and provide uniform upper bounds for the sample complexities of blind deconvolution with almost all bases or 
frames. 

The identifiability of (xo,yo) up to scaling in (BD) is equivalent to the uniqueness of Mq = Xq yf( in (Lifted 
BD). The linear operator Qde can also be represented by a matrix Gde £ C nxmim2 such that Qde(Mq) = 
GnEvec(Mo), where vec(Mo) stacks the columns of Mq £ C miXm2 on top of one another and forms a vector 
in C 1 ™ 1 ™ 2 . The columns of Gde have the form lA :j ) © E^ :,k ^ = s/nF*(D^d) © E ( : ’ fe )), where D = F n D and 
E = F n E. Clearly, the matrix Gde is a function of the matrices D and E. It has the following properties (See 
Appendix |A| for the proofs). 


Lemma 3.1. If n > mim, 2 , then for almost all D £ C" xmi and E £ C nxr " 2 , the matrix Gde has full column 
rank. 


Lemma 3.2. If n > 2 siTO2 , then for any 0 < ii < S\, for almost all Dq £ C nxtl , Di £ C" x *- Sl t '\ D 2 £ 
C nx ( s i—*i), and E £ C” xm2 , the columns of Gd 0 e, Gd x e, Gd 2 e together form a linearly independent set. 
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Lemma 3.3. Ifn> 2 siS 2 , then for any 0 < t\ < Si, 0 < t 2 < S 2 , for almost all £ (C raxtl , D\ £ C nx ( Sl *i) ; 
L > 2 e c"x(» 1 -* 0 > £ 0 € C nxt2 , £ C nx ( S2 " t2 \ and £ 2 £ C"^ 2 -* 2 ), the columns of G Do e 0 , G DiEq , Gd 2 e 0 , 
Gd 0 E\i GDiEij G EoE2 , G[) 2 e 2 together form a linearly independent set. 

Next, we state and prove sufficient conditions for identifiability of blind deconvolution with generic bases or 
frames. 

Theorem 3.4 (Subspace Constraints). In (BD) with subspace constraints, (xo,yo) £ C mi x C m2 (xo yl 0, 
?/o 7 ^ 0} is identifiable up to scaling, for almost all D £ C” xmi and E £ C™ Xm2 , if n> m\m 2 . 

Proof. By Lemma }'!. l[ if n > m±m 2 , for almost all D £ C" xmi and E £ C nxm2 , the matrix Gee has full column 

rank. Therefore, M{Qde) = {0}, and the lifted problem has a unique solution. It follows that every pair ( xo,yo ) 
is identifiable up to scaling. □ 

Theorem 3.5 (Mixed Constraints). In (BD) with mixed constraints, (xo,yo) £ C ” 7-1 x C ma (]|xo||o < Si, #o 7 ^ 0, 
2 /o yf 0} is identifiable up to scaling, for almost all D £ C™ xmi and E £ C nxm2 , if n > 2 siTO 2 - 

Proof. Fix index sets Jo, J C {1, 2, • • • , mi}, for which | J 0 | = | J| = Si and | J 0 (~| J| = t\. Let 

D 0 = J)( : Jo f| J) £ (J^raxil ^ _ £)(: Jo\J) £ ^nx(si-ti) = £)( : JVo) g (£nx(si-ti)_ 


By Lemma 3.2 if n > 2sim.2, then for almost all D and E, the columns of Gd 0 e , G Ei e^ Gd 2 e together form a 
linearly independent set. For every (# 0 , 2 / 0 ) and (*, 2 /) such that the si-sparse # 0 and x are supported on J 0 and 
J respectively, if £d.e(#o2/o’) = QDE(xy T ), then 


Gd 0 ev 0 + Gujbui + Gd 2 ev 2 = Gdev = Gde(xoVo) — Gde{xv t ) = 0, 

where v = vec(# 0 2 /o' — # 2 / T )j ^0 = vec(#} J ° ^ ^yfi — a} J ofl J '>y T ), v\ = vec(#} J °' vJ ^ 2 /(f) and u 2 = vec(—#( J Vo )y T ). 
By linear independence, the vectors vq, V\, V2 are all zero vectors, and so is v. Hence for almost all D and E, and 
all pairs (# 0 , Vo) and (#, y) such that Xq and x are supported on Jo and J respectively, if ^db(#o2/o') = GDE(xy T ), 
then # 0 Vo = xy T . Note that Jo and J are arbitrary, and there is only a finite number ((™ 1 )”) of combinations 
of Jo, J. Therefore, for almost all D and E, every pair (xo,yo) (||#o||o < si, #0 7 ^ 0, 2/0 7 ^ 0) is identifiable up to 
scaling. □ 

Theorem 3.6 (Sparsity Constraints). In (BD) with sparsity constraints, (#o> 2 /o) £ C mi x C m2 (l|#o||o < Si, 
11 2 /o 11 0 < 52.1 Xq 0, yo ^ 0) is identifiable up to scaling, for almost all D £ C nxmi and E £ C nxm2 , if n > 2 siS 2 - 

Proof. Fix index sets Jo,J C {1,2, ,nii}, for which | Jq | = |J| = si and |Jof)'^l = G, and index sets 
A'o, K C {1, 2, • • • , m 2 }, for which |Jf 0 | = \K\ = S2 and \K 0 f) K\ = t 2 ■ Let 


D 0 = D {: ' Jof]J) £C nxt \ D x 


£)(■■, Jo\J) £ (£nx(s 1-ti) JJ 2 _ jj(:,J\J 0 ) £ (inx(« i-ti) 


Eq — E {-.,K 0 f]K) g £nxt 2 ^ _ E (:,K 0 \K) g £nx(s 2 -t 2 ) ^ _ j^{:,K\K 0 ) g £nx(s 2 -t 2 ) _ 
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By Lemma 3.3 if n > 2 siS2 , then for almost all D and E, the columns of Gd 0 e 0 i Gd 1 e 0 , Gc 2 e 0 , Gd„e 
Gd 0 e 2 i Gd 2 e 2 together form a linearly independent set. For every (xo,yo) and (x,y) such that the si-sparse 
Xq and x are and supported on J 0 and J respectively, and the S 2 -sparse y 0 and y are supported on Kq and K 
respectively, if Gde{xoVo) = GDE(xy T ), then 


Gd 0 E 0 v OO + Gd 1 E 0 Vio + Gd 2 E o V20 + Gd o EiV01 + Gd 1 E 1 V 11 + Gd 0 E 2 V 02 + Gd 2 E 2 V22 

=Gdev = Gde^xqPq) - GDE(xy T ) = 0 , 

where v = vec(xoVo ~ x U T )i v oo = vec(xQ J °^ J \Q K °^ K ^ T — x^ J °^ J ' > y^ K °^ K ^ T ), vio = vec(xg Jo ^ J ' > yQ K ° R ^ T ), 

V20 = vec (— Vqi = vec(a;o J °^ J ' > yo <0 ' yK ' >T ), »n = vec(xQ J °^yQ K °^ K ^ T ), V02 = vec(—x ( - Jo ^ J )y( K \ K °' >T ), 
V 22 = vec (—x( J \ J °)y( K \ K °^ T ). By linear independence, the vectors vqo, fith ^ 20 j ^oi) W 11 , V 02 , V 22 are all zero vec¬ 
tors, and so is v. Hence for almost all D and E, and all pairs (xo, J/o) and (x, y) such that x 0 and x are supported 
on Jo and J respectively, and yo and y are supported on Kq and K respectively, if Gde{xoPq ) = GDE{xy T ), then 
xqVq = x y T ■ Note that the supports Jo, J, Kq,K are arbitrary, and there is only a finite number ((™ 1 )”(™ 2 ) 2 ) 
of combinations of supports. Therefore, for almost all D and E, every pair (xo,yo) (||xo||o < si, 11 3 / 0 11 0 < *' 2 , 
x o 7^ 0, yo 7 ^= 0) is identifiable up to scaling. □ 

Due to symmetry, we can derive another sufficient condition for the scenario where u = Dx resides in a 
mi-dimensional subspace spanned by the columns of D, and v = Ey is S 2 -sparse over E. 

For generic bases or frames, the above sample complexities n > m\m 2 , n > 2s\m.2 or n > 2 siS2 are sufficient. 

These sampling complexities are not optimal, since they are in terms of the number of nonzero entries in x^y^ , 
instead of the number of degrees of freedom in Xq and y q. For example, in the scenario with subspace constraints, 
Theorem |3.4| requires n > mp7i2 samples, versus the number of degrees of freedom, which is mi +7712 —1. However, 
these results hold with essentially no assumptions on D or E. They are the first algebraic sample complexities 
for blind deconvolution. 


4 Blind Deconvolution with a Sub-band Structured Basis 

In this section, we consider the BD problem where the filter resides in a subspace spanned by a sub-band 
structured basis. For this setup, using the general framework for bilinear inverse problems we introduced recently 


in 1191, and Proposition 2.3 above, we derive much stronger, essentially optimal sample complexity results. 


Definition 4.1. Let E = F n E, E £ C nx "* 2 , and let Jk denote the support of E^' k ' > (1 < k < m 2 )- If 


Jk = Jk \ ( |J Jk> ) ^ 0 for 1 < k < m 2 , 

1 k'^k 


then we say E forms a sub-band structured basis. The nonempty index set Jk and its cardinality Ik ’■= \ Jk\ are 
called the passband and the bandwidth of E^ ,k \ respectively. 




Like filters in a filter bank, the basis vectors of a sub-band structured basis are supported on different sub¬ 


bands in the Fourier domain (Figure 1(a)). By Definition |4.1[ the sub-bands may overlap partially. For each 
sub-band, its passband consists of the frequency components (which need not be contiguous) that are not present 
in any other sub-band. For example, in acoustic signal processing or communications, an equalizer that adjusts 
the relative gains y^ of different frequency components can be considered as a filter v = Ey that resides in 
a subspace with a sub-band structured basis. See Figure [T(b)| for the DFTs of three different equalizers, and 
Figure [2] for the filter bank implementation of an equalizer. 




Figure 1: A sub-band structured basis, (a) DFTs of basis vectors, (b) Examples of frequency 
responses of filters in the span of the sub-band structured basis. 



Figure 2: Filter bank implementation of an equalizer. 

Next, we address the identifiability of the blind deconvolution problem where the filter resides in a subspace 
with a sub-band structured basis, and the signal resides in another subspace, or is sparse over some given 
dictionary. For example, consider the following blind deconvolution problem in channel encoding. An unknown 
source string x is encoded by a given tall-and-skinny matrix D and then transmitted through a channel whose 
gains in different sub-bands are unknown. Then the encoded string Dx resides in a subspace spanned by D , 
and the channel resides in a subspace with a sub-band structured basis. Simultaneous recovery of the channel 
and the encoded string from measurements of the channel output corresponds to blind deconvolution with a 
sub-band structured basis. Another example is the channel identification problem where the acoustic channel 
can be modeled as the serial concatentation of an equalizer and a multipath channel. The equalizer has known 
sub-bands but unknown gains. The multipath channel can be modeled as a sparse filter. Then the simultaneous 
recovery of the sparse multipath channel and the equalizer from the given input and measured output of the 
channel corresponds to blind deconvolution with a sub-band structured basis. 

We consider first the case of subspace constraints, with one of the bases having a sub-band structure. 
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Theorem 4.2. In (BD) with subspace constraints, suppose E forms a sub-band structured basis, Xq £ C mi is 
nonzero and yo £ C ma is non-vanishing. If the sum of all the bandwidths > mi + m 2 — 1, then for 

almost all D £ C nxmi , the pair (xo,yo) £ fix x Dy identifiable up to scaling. 

Proof. Let D = F n D, E = F n E. By the sub-band structure assumption, E has full column rank. For nonzero 
xq and for almost all D, all the entries of Dx 0 are nonzero and the matrix di&g(Dxo)E has full column rank. If 
there exists y £ fly such that (Dx 0 ) © (Ey) = (Dx 0 ) © (Eyo), then 

diag (Dx 0 )Ey = (Dx 0 ) © (Ey) = (Dx 0 ) 0 (Eyo) = diag(Z> x 0 )Ey 0 . 

It follows that y = yo- By Proposition |2.3| to complete the proof, we only need to show that if there exists 
(x, y) £ Dx x f ly such that (Dx) © (Ey) = (Dx 0 ) © (Eyo) then x = ax 0 for some nonzero a. 

If there exists (x,y) £ Dx x Dy such that (Dx) © (Ey) = (Dx 0 ) © (Eyo), we have 

d\&g(Ey)Dx = diag(Ey 0 )Dx 0 . 


Considering the passband Jk, we have 


diag (E^ k ' : ' > y)D^ k ''^x = diag(E^ k,: ^yo)D^ k ’^xo, 


or equivalently 


diag (E&’^D&^xyW = diag(E^ k ^)D^ k ^x 0 y { o\ 
D&^xyW = D^xoy^. 


By assumption, y^ 7 ^ 0. For almost all D, D^ Jk ^x 0 7 ^ 0. Hence y^ 7 ^ 0, x 7 ^ 0. It follows that 


(fc) 


DA ' :) ( x -%j x °) = 0 ’ 


Hence 


x £ J\f(D^ k,: ' > ) + span(cco). 


Let Xq denote the orthogonal complement of span(xo). Then 


P x ±x = x- P s pa .n(x 0 ) x 6 A f(D ( - Jk ’ :) ) + span(xo). 


Hence 


(1) 


P x ±x £ x^ P| (M(D {Jk ’ :) ) +span(a; 0 )] = x^ R P|zo ) > Vfc £ {1, 2, • • • , m 2 }, (2) 
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where (•)* denotes the conjugate transpose. The equation holds due to the fact that, for linear vector spaces V\ 
and V 2 , Vi + V 2 = (Vf f| V^. 

Now, for almost all D , 7is a generic ^.-dimensional subspace of C mi , and 7 Z(D ( ' Jk ^*) <£_ Xjj - . Hence 
there exists a generic (Ik — l)-dimensional subspace Vk C Xq- such that 

7= V fc © span(P w(S( j fci0 . ) s 0 ), 

IZ(D^ k ^*) P) Xq = Vk- 


Therefore, <© is equivalent to 

7712 

e n ^ n ^ n ■ ■ ■ n = (span(x 0 )+e 

*=1 

where Vi, V 2 , • • • , V m2 are generic subspaces of Xq , the dimensions of which are l\ — 1 ,£ 2 — 1, • ■ ■ ,£ m2 — 1. For 
any such generic subspaces of , if Y^k= l — TOl + m 2 — 1, he., YL'k^xi^k — 1) > mi — 1, then, 


7712 


E v * 


= x 


_L 
0 • 


Hence 


7712 

span(xo) + ^ Vfc = C mi , 

k =1 
771 2 

P Xj rx e (span(xo) + E v fc ) x = {0}. 

fc=l 

Therefore, P x ±x = 0, or x £ span(xo). We have shown that a:/0, hence there exists a nonzero er £ C such that 
x = axQ. The proof is complete. □ 


We turn next to the case of blind deconvolution with mixed constraints, where the signal lives in a subspace 
spanned by a sub-band structured basis, and the filter is sparse. 

Theorem 4.3. In (BD) with mixed constraints, suppose E forms a sub-band structured basis, Xq £ C mi satisfies 
that ||xo || o 5; si and xo 7 ^ 0, and yo £ C m2 is non-vanishing. If the sum of all the bandwidths > 

2si + m 2 — 1, then for almost all D £ C nxmi , the pair ( Xo,yo ) £ x Dy is identifiable up to scaling. 


Proof. The proof is very similar to that of Theorem |4.2| For nonzero Xq and almost all D, if there exists y £ fly 


such that ( Dxq ) © (Ey) = ( Dxo ) © (£7/0), then y = yo. By Proposition 2.3 to complete the proof, we only need 
to show that if there exists (, x,y ) £ fix x fly such that ||x|| 0 < Si and ( Dx ) © (Ey) = ( Dx 0 ) © ( Ey 0 ), then 
x = ax 0 for some nonzero a. 

Denote the support of xq by K 0 , \K$\ = Si. If there exists (x,y) £ fix x fly such that x is supported on I\, 


11 







YYYY?T?T 


| | | | I ;j) | | 


(a) 


(b) 


Figure 3: A sub-band structured basis with supports Jk that partition the DFT frequency range, 
(a) DFTs of basis vectors, (b) Examples of frequency responses of filters in the span of the basis. 

\K\ = Si, and (Dx) © ( Ey) = {Dx o) © ( Ey 0 ), then 

dmg(Ey)D^’ Ko ^ K ">x^ Ko ^ K) = dmg(Ey 0 )D^ K °^ K ^x { 0 Ko[J K) . 


In this case, 0 and 0 in the proof of Theorem |4.2| become 


x (Ko U K) G U *)) + span(4 Kt1U K) ), 

P x (K 0 u K y LX (A '° u k) g Xq K ° U k)± Pi (lZ(D^ k ’ K ° U^)*)p x [ Ka U K)X ^ ± , Vfe G {1,2, • ■ • , to 2 }. 

Since |/v 0 | = \K\ = Si, we have \K 0 \JK\ < 2s\. If Y1T= i^ k — ^ Sl + m 2 — 1, then by an argument analogous 

we have that for almost all D, P (k 0 \jk)±x( k °^ k ^ must be 0. Therefore, 


to that in the proof of Theorem 


4.2 


there exists a nonzero a £ C such that x = <jxq. 

We complete the proof by enumerating all supports K of cardinality si. Since there is only a finite number 
(("P) of such supports, for almost all D , if there exists (x,y) such that x is si-sparse and (Dx) © (Ey) = 
(Dx q) © (Ey 0 ), then x = ax o for some nonzero cr. □ 


How do the sufficient conditions of Theorems |4.2| and |4.3| compare to the minimal required sample complex¬ 
ities? We address this question for the following scenario. Suppose that the supports Jfc (1 < k < m 2 ) form a 
partition of the frequency range, i.e., 


Jk\ n Jfe 2 = 0 for all k\ and fc 2 such that ki ^ /c 2 ; 

U 4 = {1,2,--- ,n}. 

I<fc<m2 

In this case the passbands are Jk = Jk and n = Y^k=i^k- For example, this scenario applies when the filter 
bank is an array of ideal bandpass filters whose passbands partition the DFT frequency range (See Figure [3]). 
Consider first (BD) with subspace constraints. Under the above scenario, the sufficient condition in Theorem 
|4. 2 1 implies n > mi + m 2 — 1. Next, we show that this sample complexity is also necessary. 


Proposition 4.4. In (BD) with subspace constraints, suppose E forms a sub-band structured basis, for which the 
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supports Jk (1 < k < m 2 ) are disjoint and cover all the frequency components. If ( xo,yo ) (do is non-vanishing) 
is identifiable up to scaling, then n > m\ + m 2 — 1. 

We turn next to (BD) with mixed constraints. Under the assumption that the passbands partition the DFT 
frequency range, the sufficient condition in Theorem |4.3| implies n > 2s\ + m 2 — 1. Next, we show that this is 
almost necessary. 

Corollary 4.5. In (BD) with mixed constraints, suppose E forms a sub-band structured basis, for which the 
supports Jk (1 < k < m 2 ) are disjoint and cover all the frequency components. If (£o,2/o) (xo is s\-sparse, yo is 
non-vanishing) is identifiable up to scaling, then n > s 1 + m 2 — 1. 

The sample complexities in the sufficient conditions match (exactly for (BD) with subspace constraints and 
almost for (BD) with mixed constraints) those in the necessary conditions, hence they are optimal. The sample 
complexities are also optimal in the sense that the number of degrees of freedom is roughly equal to the number 
of measurements. We give the proofs of Proposition pj~T| and Corollary |4.5| in Appendix |B| 

5 Conclusions 

We studied the identifiability of blind deconvolution problems with subspace or sparsity constraints. We derived 
two algebraic conditions on blind deconvolution with subspace constraints. We first showed using the lifting 
framework that blind deconvolution from n observations with generic bases of dimensions m\ and m 2 is iden¬ 
tifiable up to scaling given that n > m\m 2 . Then we applied the general framework in [19| to show that blind 
deconvolution with a sub-band structured basis is identifiable up to scaling given that n > m\ + m 2 — 1. The 
second result was shown to be tight. These results are also generalized to blind deconvolution with sparsity 
constraints or mixed constraints, with sparsity level(s) replacing the subspace dimension(s). The extra cost for 
the unknown support in the case of sparsity constraints is an extra factor of 2 in the sample complexity. 

We acknowledge that the results in Section [3] for generic bases may not be optimal. But they provide the first 
algebraic conditions for feasibility of blind deconvolution with subspace or sparsity priors. Furthermore, taking 
advantage of the interesting sub-band structure of some bases (such as filters in a filter bank implementation of 
equalizers), we achieved sample complexities that are essentially optimal. Our results are derived with generic 
bases or frames, which means they are violated on a set of Lebesgue measure zero. 

An interesting question is, without the sub-band structure, whether or not it is possible to provide an algebraic 
analysis of blind deconvolution that achieves optimal sample complexities. Other ongoing research topics includes 
identifiability in blind deconvolution with specific bases or frames that arise in applications. 

A Proofs of Lemma |3.1| , |3.2| and 

Proof of Lemma \3.1\ The entries of Gde are multivariate polynomials in the entries of D and E , or to be more 
specific, quadratic forms in the entries of D and E. By Lemma 1 from |21| , the matrix Gee has full column 
rank for almost all D and E if it has full column rank for at least one choice of D and E. 


3.3 


13 







We complete the proof by by showing that Gde has full column rank for the following choice of D and E. 
Let D = F~ 1 D , with D € C" xmi chosen such that all its submatrices have full rank. (For example, this will 
hold with probability 1 for a random matrix with iid Gaussian entries.) Let E = F~ 1 E , with E £ C nxm2 chosen 
such that the first m\m ,2 rows are the kronecker product: 


p(l:mim 2 ,:) _ t 
— 1 rr 


*-rrii ,1 • 


Let Gde = F n GoE, then the submatrix containing the first TO 1 TO 2 rows of Gde /is 


p(i ;m i r) 



DE 






By the assumption that all submatrices of D have full rank, it follows that G^™ 1 ™ 2 ’^ / y/fi has full column rank 
mim 2 - Therefore, Gde has full column rank. □ 


Proof of Lemma \3.S\ Let D = [Do, D\, £> 2 ], then Gde is a permutation of the columns of [Gd 0 e, Gdj^e, Gd 2 e]- 
It is sufficient to prove that Gde has full column rank, which follows from Lemma |3. 1 1 because the number of 
columns in D is mi = t\ + 2 x (si — ii) = 2si — 1\ < 2si and n > 2s±m2 > 77117712 - □ 


Proof of Lemma [/Hj Let D = [D 0 ,D 1 ,D 2 ], D' = [D 0 ,Di] and D" = [D 0 ,D 2 ], then [G D e 0 ,Gd'e 1 ,G D "e 2 ] 
is a permutation of all the columns of Gd 0 e 0 , Gd!E 0 , Gd 2 e 0 , Gd 0 e 1 , Gd 1 e 1 , Gd 0 e 2 , Gd 2 e 2 ■ By Lemma 1 
from (2l], it is sufficient to show that [Gde 0 ,Gd i e 1 ,Gd"e 2 \ has full column rank for at least one choice of 
Do, D\, D 2 , Eq, Ei, E 2 - 

We complete the proof by showing that [Gde 0 ,Gd , e 1 ,Gd"e 2 ] has full column rank for the following choice. 
Let Dq,Di, D 2 be chosen such that all submatrices of Do, Di, D 2 have full rank. Let Eq,Ei,E 2 be chosen such 
that the first 2 siS 2 rows of Eq,Ei,E 2 are 



1 

CO 

<N 

r—1 

<S> 

1_ 


®2sit2,S2 — 1 2 


®2sit2,S2~t2 

7 ti( 1 : 2 siS 2 j‘) 

-^0 

0«l(S2— *2).t2 

7—1( 1:2si s 2 5 •) 

J -^1 — 

Is2~t2 ® lsi,l 

rp(l:2siS2,-) _ 

? -^2 — 

Qsi(s2-t 2 ),S 2 -t 2 


®s 1 (s 2 -t 2 ),t 2 


^ 2 ) 5^2 ^2 


1 

CO 

1—1 

1 

<N 


By the proofs of Lemmas 


3.1 


and 


3.2 


G 


(1:2siS2,: 

DEq 


, u D , 




D' Ei 


and G 


(l:2sis 2 ,0 
D"E 2 


all have full column rank, and their 


nonzero entries are located in three disjoint row blocks. Hence [Gde 0 , Gd’e x , Gd"b-,]^ 1:2siS!,: -* has full column 
rank. Therefore, [Gde 0 ,Gd , e 1 ,Gd"e 2 ] has full column rank. □ 


14 


















B Proofs of the Necessary Conditions 


Proof of Proposition \4-4\ We show that if n < mi + m 2 — 1, then ( Xo,yo ) is not identifiable up to scaling. Let 
D l £ c nx ( n_m 1 ) denote a matrix whose columns form a basis for the orthogonal complement of the column 
space of D. Then D* ± is an annihilator of the column space of D , i.e., D*^D = 0. Let E- lnv £ C nxm2 denote the 
entrywise inverse of E : 


^inv - 


if Ebb) ^ 0 , 


0 if E <l - j) = 0. 


Consider the linear operator Q : C m2 —» C" mi defined by 


G(w) = D* ± diag (E inv w) diag(Ey 0 )Dx 0 


for w £ C™ 2 . We claim that every non-vanishing null vector of Q produces a solution to the BD problem. Indeed, 
if w\ £ J\T(G) is non-vanishing, then diag(£ , i nv u'i) di&g(Eyo)Dxo is annihilated by D*^ and therefore must reside 
in the column space of D. Hence, there exists x\ £ C mi such that 

diag(i?i nv wi) dia,g(Ey 0 )Dx 0 = Dx 1 . (3) 

Now, let 2/1 denote the entrywise inverse of w\. Recall that the supports of the columns of E are disjoint, hence 
Eyi is the entrywise inverse of E- lnv wi. By Equation (J3|, 

dmg(Ey 0 )Dx 0 = diag(^2/i)5aq, 

(Dx 0 ) © (Ey 0 ) = (Dx 1 ) © ( Ey 1 ). 

Hence [x\,yi) is a solution to the BD problem where z = (Dx 0 ) © ( Eyo ). This establishes the claim. 

It remains to show that Q does have a non-vanishing null vector, and that the solution it produces does not 
coincide, up to scaling, with (xo,yo). Let wq denote the entrywise inverse of y 0 , then wq £ N(Q). There are 
(n — mi) equations in Q(w) =0. If n < mi + m 2 — 1, then n — mi < m 2 — 2 and the dimension of A f(G) 
is at least 2. Hence, there exists a vector w 1 £ A f(G) such that u>o,wi are linearly independent. Let a be a 
complex number such that 0 < |a| < jj^jj—'j, y—. Then wq + awi £ A f(Q) is non-vanishing, because the entries 
of wo + aw 1 satisfy that 

+ aw i ] I > - Mk’i^l > Ti—71-|a|||wi||oo > 0, for J = 1,2,--- ,m 2 . 

11 Vo 11 00 

Since a/0, the null vector wq + aw 1 is not a scaled version of Wq. It produces a solution (X 2 , 2 / 2 ) in which y 2 
is the entrywise inverse of wq + aw 1 and is not a scaled version of yo. Therefore, (xo,yo) is not identifiable up 
to scaling. □ 

Proof of Corollary\4-5\ The vector xq is si-sparse. If we know the support of Si, then the signal u = Dx 
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resides in a subspace spanned by si columns of D and the problem reduces to BD with subspace constraints. 


By Proposition 4.4 if n < si + m 2 — 1, then (xo,yo) cannot be identified up to scaling even if the support 
of Xq is given. Hence (xg,yo) is not identifiable without knowing the support. Therefore, it is necessary that 
n > si + m 2 — 1. □ 
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